Welsh letter-to-sound rules: rewrite rules and two-level rules compared
نویسنده
چکیده
In a text-to-speech synthesis system, input words not found in the system's lexicon are passed to letter-to-sound rules, which derive the word's pronunciation. In Welsh, the letter-to-sound rules must be applied in three passes: firstly, to add epenthetic vowels, secondly, to determine stress and vowel location, and thirdly, to perform grapheme-to-phoneme conversion. To begin with, all these letter-to-sound rules were written in the form of context-sensitive rewrite rules, and were evaluated, giving a 96% success rate. The rules for the second pass were then rewritten in the form of two-level rules, using the PCKIMMO software package. The output was identical to that produced by the second block of rewrite rules. The two-level formalism had advantages in simplifying rules. However, there were difficulties due to the need to force the rules to operate in a deterministic fashion. In a practical text-to-speech system, the rewrite rule formalism would be favoured, despite the greater number of rules and their greater clumsiness, since the critical ordering of rewrite rules easily introduces the necessary determinism.
منابع مشابه
Re-Engineering Letter-to-Sound Rules
Using finite-state automata for the text analysis component in a text-to-speech system is problematic in several respects: the rewrite rules from which the automata are compiled are difficult to write and maintain, and the resulting automata can become very large and therefore inefficient. Converting the knowledge represented explicitly in rewrite rules into a more efficient format is difficult...
متن کاملUsing Rules to Improve Letter to Sound Conversion of Names
This paper presents an investigation of the use of context sensitive rewrite rules for improving the performance of data driven letter to sound conversion, concentrating on the specific case of British names. Taking a practical point of view, emphasis is put on reduction of the worst phonetization errors, and on improving the maintainability of the system helping in database cultivation, and al...
متن کاملText-to-speech synthesis for welsh and welsh English
English sounds that had no equivalent in Welsh (equivalences derived from [4]). These three segments were: / /, / /, and / /. Tables 1 and 2 show the South Welsh phonemes, with their equivalents in RP and the extra English segments. This work represents the first known attempt to develop a text-to-speech synthesiser for Welsh. A list of pseudo-Welsh nonsense words was generated, allowing for ce...
متن کاملKRAM—Extended Report
Term rewriting proved to be a simple, uniform and powerful computational paradigm. Rewrite rules independently match and apply anywhere, unconstrained by the context. Rewriting is particularly appealing for defining truly concurrent systems, since rewrite rules can apply in parallel. Unfortunately, there is an inherent impediment in using term rewriting for defining concurrent systems or progra...
متن کاملYazd Medical Interns' Knowledge on Legal Rules and Religious Orders in Medicine
Introduction: In addition to precise physical examination and objective observations, it is necessary for students and interns to be knowledgeable about the rules and religious orders in medicine. This study was performed to investigate Yazd interns' knowledge on religious orders and legal rules in medicine. Methods: In this descriptive study performed in educational health centers of Yazd, 12...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computer Speech & Language
دوره 8 شماره
صفحات -
تاریخ انتشار 1994